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Abstract — A variety of low-density parity-check (LDPC) en- 
sembles have now been observed to approach capacity with 
message-passing decoding. However, all of them use soft (i.e., non- 
binary) messages and a posteriori probability (APP) decoding 
of their component codes. In this paper, we analyze a class of 
spatially-coupled generalized LDPC codes and observe that, in 
the high-rate regime, they can approach capacity under iterative 
hard-decision decoding. These codes can be seen as generalized 
product codes and are closely related to braided block codes. 

Index Terms — GLDPC codes, density evolution, product codes, 
braided codes, syndrome decoding 

I. Introduction 

In his groundbreaking 1948 paper, Shannon defined the 
capacity of a noisy channel as the largest information rate 
for which reliable communication is possible 0. Since then, 
researchers have spent countless hours looking for ways to 
achieve this rate in practical systems. In the 1990s, the 
problem was essentially solved by the introduction of iterative 
soft decoding for turbo and low-density parity-check (LDPC) 
codes 0, 0, 0. Although their decoding complexity is 
significant, these new codes were adopted quickly in wireless 
communication systems where the data rates were not too 
large 0, 0. In contrast, complexity issues have slowed their 
adoption in very high-speed systems, such as those used in 
optical and wireline communication. 

In this paper, we consider an ensemble of spatially-coupled 
generalized LDPC (GLDPC) codes based on i-error correcting 
block codes. For the binary symmetric channel (BSC), we 
show that the redundancy-threshold tradeoff of this ensemble, 
under iterative hard-decision decoding, scales optimally in the 
high-rate regime. To the best of our knowledge, this is the first 
example of an iterative hard-decision decoding (HDD) sys- 
tem that can approach capacity. Extrinsic information (EXIT) 
curves are also used to observe the phenomenon of threshold 
saturation for this decoder. It is interesting to note that iterative 
HDD of product codes was first proposed well before the 
recent revolution in iterative decoding but the performance 
gains were limited [7]. Iterative decoding of product codes 
became competitive only after the advent of iterative soft 
decoding based on the Turbo principle 0. 

Our choice of ensemble is motivated by variations product 
codes now used in optical communications [9]. These, in turn, 
are also closely related to braided block codes IflOl . ATI . Our 
focus on HDD of generalized product codes based on i-error 
correcting block codes is similar to recent work by Justesen, 



who has been exploring a variety of coding system for optical 
communication systems lfl2l . fl3l . The main difference is 
that the spatially-coupled GLDPC ensemble can be analyzed 
via DE, which allows one to rigorously account for spatial 
coupling and miscorrection. From DE, one can observe that 
iterative hard-decision decoding (HDD) can approach capacity. 

II. Basic Ensemble 

Let C be an (n, k, d) binary linear code that can correct 
all error patterns of weight at most t (i.e., d > 2t + 1). For 
example, one might choose C to be an primitive BCH code 
with parameters (2" — 1, 2 V — vt — 2, 2t + 2). Now, we consider 
a GLDPC ensemble where every bit node satisfies two code 
constraints defined by C. 

Definition 1. The (C,m) GLDPC ensemble is defined by a 
Tanner graph, denoted by Q = (I U JT, £ ), with a set X of 
N = degree-2 bit nodes and a set J of m degree-n code- 
constraint nodes defined by C. A uniform random permutation 
is used to connect the mn edges from the bit nodes to the 
constraint nodes. 

A. Decoder 

It is well-known that GLDPC codes perform well under 
iterative soft decoding [8]. The main drawback is that a 
posteriori probability (APP) decoding of the component codes 
can be very complex. For this reason, we consider message- 
passing decoding based on bounded-distance decoding (BDD) 
of the component codes. Let djj(-, •) be the Hamming distance 
and Di : {0, 1}™ — !• {0, 1} be the bit-level mapping implied 
by BDD, which is given by 

I Ci if c G C satisfies dnic^v) < t 
Di(v) — < 

I Vi if d}{ (c, v) > t for all c G C. 

More generally, our analysis holds for symmetric decoders 
where the mapping Di(-) satisfies Di(v © c) = Di(v) © Cj 
for all c G C and i = 1, . . . , n. For example, all syndrome 
decoders have this property. 

Decoding proceeds by passing binary messages along the 
edges connecting the variable and constraint nodes. Let ri G 
{0, 1} denote the received channel value for variable node i 
and (J-iJ^j be the binary message from the i-ih variable node 
to the j-th constraint node in the £-th iteration. For simplicity, 
we assume no bit appears twice in a constraint and let Gj (k) 



be the index of the variable node connected to the fc-th socket 
of the j-th constraint. Let j' be the other neighbor of the i- 
th variable node. Suppose that <?j(k) = i. Then, the iterative 
decoder is defined by the recursion 

= A (t&) , (l) 

where, for the i-th variable node and the 7-th con- 
straint node, the candidate decoding vector is VjJ^ = 

(u>^\i\ . ,~\ > , ) except that the fc-th entry 

is replaced by r, L . 

B. Density Evolution 

The iterative decoding performance of GLDPC codes can be 
analyzed via density evolution (DE) because, for a randomly 
chosen bit node, any fixed-depth neighborhood in the Tanner 
graph is a tree with high probability as m — > oo. For HDD 
of the component codes, this DE can be written as a one- 
dimensional recursion. 

If we assume that the component decoders are symmetric, 
then it suffices to consider the case where the all-zero code- 
word is transmitted over a BSC with error probability p. Let 
x^ be the error probability of the hard-decision messages 
passed from the variable nodes to the constraint nodes after £ 
iterations. For an arbitrary symmetric decoder, let P n (i) be the 
probability that a randomly chosen bit is decoded incorrectly 
when it is initially incorrect and there are i random errors in 
the other n— 1 positions. Likewise, let Q n (i) be the probability 
that a random chosen bit is decoded incorrectly when it is 
initially correct and there are i random errors in the other 
positions. Then, for the (C, m) GLDPC ensemble, the DE 
recursion implied by (Q]l is defined by x^ t+1 ' = /„ (x^']p\ 
with p = 1 — p and 

n-l 

/„ (*;p)^(Y)« , "(l-») ,,+1 (pPn (*) +PQn (i)) (2) 

i=0 

The quantities P(i) and Q(i) can be written in terms of 
the number of codewords, Ai, of weight I in C. Since the 
decoding regions of each codeword are disjoint, one finds that, 

for t < i < n — 1 , 

t i+l+8 ( I \( n-l-l \ 

6=CU=i+l-<5 V i ) 

and P n [i) = for < i < t — 1. Similarly, one finds that 

t i+a / '-1 \(, n-i \ 

«)=£ £ (4) 

6=Oi=j-<5 V i / 

for i + 1 < j < n - 1, and Q n (i) = for < i < t. 

Similar to DE for LDPC codes on the BEC [14 pp. 
95-96], there is a compact characterization of the hard- 
decision decoding threshold p* . The idea is that the suc- 
cessful decoding condition f n (x;p) < x provides a natural 



lower bound on the noise threshold that can be rewritten as 

p [f n (x; 1) - f n (x; 0)] + f n (x; 0) < x to give 

. , x- f n (x;0) 

P = lnr 7 \ 7 \- 

xg(o,i) f n (x;l) - fn{x;0) 

It is also worth noting that essentially the same recursion can 
be used for a BEC with erasure probability p. In this case, 
Qn{i) = and one redefines P n (i) to be the probability that 
a randomly chosen bit is not recovered when it is initially 
erased and there are i random erasures in the other n — 1 
positions. 

III. Spatially-Coupled Ensemble 

Now, we consider a spatially-coupled GLDPC ensemble 
where every bit node satisfies two code constraints defined by 
C. Similar to the definition introduced in [15], the spatially- 
coupled GLDPC ensemble (C, m, L, w) is defined as follows. 

Definition 2. The (C,m,L,w) spatially-coupled GLDPC en- 
semble can be constructed from the underlying Tanner graph 
with n degree-2 bit nodes and 2 degree-rt code-constraint 
nodes defined by C. The Tanner graph of a spatially-coupled 
GLDPC contains L positions of bit nodes and L + w — 1 
positions of code-constraint nodes. At each position, there 
are degree-2 bit nodes and m degree-n code-constraint 
nodes. The (C, m, L, w) spatially-coupled GLDPC ensemble is 
constructed so that (i) the 2 edges from a bit node at position i 
are uniformly and independently connected to code-constraint 
nodes at positions [i, . . . , i + w — 1] and (ii) the n edges of a 
code-constraint node at position i are independently connected 
to bit nodes at positions [i — w + 1, . . . , i]. 

A. Density Evolution 

To derive the DE update equation of the (C, m, L, w) 
spatially-coupled GLDPC ensemble, we let x\ be the average 
error probability of hard-decision messages emitted by bit 
nodes at position i after the ^-th iteration. By the second 
property of the (C, m, L, w) spatially-coupled GLDPC ensem- 
ble, the average error probability of all inputs to a code- 
constraint node at position i is x"p = i J2j=o x i-j ■ From the 
first property, it follows that xf ] = i J2k=o fn (^f+k^yP)^ 

where f n (x;p) is defined in © and xf' = for i [1,L]. 
Therefore, the DE update for this ensemble is given by 

1 I 1 w ~ 1 \ 

3=0 \ 3=0 / 

IV. BCH Component Codes 

In the remainder of this paper, the (n,k,2t + 1) binary 
primitive BCH code as well as its even weight (n, k—1, 2t+2) 
subcode will be used as the component codes for both (C, m) 
GLDPC and (C,m, L,w) spatially-coupled GLDPC ensem- 
bles. When the exact weight spectrum is known, one can use 
it to compute P n (i) and Q n (i) by (0 and (0), respectively. 



Otherwise, we use the following binomial approximation for 
binary primitive BCH codes with large n [16|, 




d < I < n - d 
l = QJ = n 
otherwise, 



(6) 



where n = 1 V —\, Given these binomial approximation, we are 
able to simplify P n (i) and Q n (i) based on similar calculations 
in |fl6l Appendix A]. The approximation of P n (i) and Q n (i) 
for a primitive BCH code allows the simplified expressions 



Pn(i) = 1 



(n+1) 



i + 



5=0 a=0 



n — i — 1 

<5 — a 



1 



, (7) 



and 



— ' n — ? \ 




5=1 \ x 7 q=0 

By substituting (0 and ([8]) into (12), one can get the DE 
recursion of the (C,m) GLDPC ensemble. Likewise, substi- 
tuting (0 and (O into (0, we have the DE recursion of the 
(C,m,L,w) spatially-coupled GLDPC ensemble. 

For the (n,k — l,2t + 2) even-weight subcode of an 
(n,k,2t+l) primitive BCH code, we approximate the number 
of codewords, denoted by Ai, by © when I is even, and 
set Ai = when I is odd. Let P n (i) and Q n (i) be the 
miscorrection probabilities corresponding to the even-weight 
BCH subcode. Then, one can get the DE recursions for 
the (C,m) GLDPC ensemble and the (C,m, L,w) spatially- 
coupled GLDPC ensemble can be obtained by using (O and 
©, respectively. 

A. High-Rate Scaling 

In lfl3l . Justesen has analyzed the asymptotic performance 
of long product codes under the assumption that the decoder 
has no miscorrection. Using the random graph argument, 
a recursion for the "Poisson parameter" is obtained. That 
recursion leads to a threshold, for successful decoding, on 
the average number of errors in each code constraint. In this 
section, we obtain a similar recursion as the scaled limit, as 
n —> oo, of our DE analysis. The novel contribution is that 
this approach rigorously accounts for miscorrection. We first 
introduce a few lemmas that simplify the development. 

Lemma 3. P n (i) = for < i < t — 1 and the upper bound 
Pn(i) _■ 1 for t < i < n — 1 is sharp as n — > oo. 

Lemma 4. Q n (i) — for < i < t. For any e > 0, there 
exists an N > such that, for all n > N and any ft £ (0, 1), 
it follows for < i < [fi(n — 1)J that 

A /.s 1 £ 

Q„(») < 



n(l-/?)(t- 1)! n{l-l3y 



and 



Q„(i) < 



n(n+l)(l-/3) Wt-2)! 



(1-/3) {(t-1) 



if i + t is odd 
if i + t is even. 



Consider the DE recursion for the (C, m) GLDPC ensemble 
based on ©. Let p = ^— j- with fixed A^ ^ scale with n, and 

let A)P = (n — l)x^ . The scaled recursion for A« is given 
by 



A(f+ 1 ) = (n-l)/„ 



A 



A (o) 



n-1 n-1 



where Al 0) = \M. Since P n {i) = Q n (i) = for < i < t-1 
and Lemma [3] shows P n (i) — » 1 for t < i < n — 1, we find 
that 



A^)<]T 



- 1 'n-A/AW 



n — 1 



A^ 
n — 1 



x (AW(l-Q„(i)) + (n-l)Q n (i) 



(9) 



(8) Lemma 5. Denote the tail probability of the Poisson dis- 



tribution with mean A by tfi(\;t) = X)i=t+i 7T e A = 1 ~ 
2<=o 7T e_A- ^ or A > 0, define h = n — 1 ana? 
a n = An -1 , 77zen, 

lim V Waj,(l-a„) M (3»(i)=0 1 (10) 

ri^oo * — ' V % I 
i=t+l v ' 

and 

lim V] (1 - nQ n (i)< t}^^}. , (11) 

where Q n (i) is defined in dS]). 

Sketch of proof : For any finite n, the RHS of ( TTOb and 
( fTTT ) can be separated into two parts, summation over [t + 
1, L/? n J] an d summation over [ [/3nJ +1, oo], where /3 £ (0, 1). 
By applying the results in Lemma HI both ( fTOb and (fTTT) can 
be obtained. ■ 
Define A^ = lim sup^^.^ A« . Substituting the approxi- 
mation Q n (i) for Q n (i) in (O and applying Lemma allows 
us to obtain 



t-1 



1 



(t-1) 



(12) 



Remark 6. For any n < oo, can be seen as the average 

number of initial error bits attached to a code-constraint node, 
and n _" 1 can be considered as the average number of error 
bits per code-constraint node after ^-th iteration. From 



and limsup„_ 



— > 



it follows that the scaled 



■>oo ,i-l 

recursion ( fT2l tracks the upper bound of the average number of 
error bits per code-constraint node with iteration. Also, while 
the derivation of ( fT2l involves an approximation before the 
limit, we believe that final expression is exact. 

When the even-weight BCH subcode is used as the compo- 
nent code, we can have an upper bound of Q n (i) similar to 
( ITTb . Let I t 

„-2A L 2 J , 2 i 



l 



\ 2 

^ (2i)l 



be the tail probability for the even terms of a Poisson distri- 
bution with mean A and started at t + 1, and 



1 - e 



-2A 



LIJ 

E 



A (2*+l) 

(2t + l)!* 



be the tail probability for the odd terms of a Poisson distribu- 
tion with mean A and started at t+1. With the same definitions 
of A, a n and h in Lemma [5] it can be shown that 



lim y 

i=t+l 



< 



a l n (1 - a n )" l nQ n {i) 

(t-i)\ V'(^) ^ ^ i s even > 
(t— l)t y(^) ^) otherwise. 



Then, the DE recursion of the GLDPC ensemble with the 
even-weight BCH subcode can be obtained by using fl2\ as 
well. When i is even, replace </>(A;t) in (TTZl i with ip(\;t), or 
otherwise, replace </)(A; t) with yj(A; i). 

For the spatially-coupled GLDPC ensemble, let A' -* be the 
average number of initial error bits per component code, and 
\f ] be the average number of errors per component code at 
position i after i-th iteration. Define = ^ J2j=o ^i-j 
as the averaged number of error inputs to a code-constraint 
node at position i. Similar to (fT2l >. the recursion for spatially- 
coupled ensemble is 

1 w ~ 1 1 



j=0 



Also, replacing </>(A; i) in (fl~3T > with ^(A; t) or yj(A; i) accord- 
ing to the parity of t gives us the DE recursion of the spatially- 
coupled ensemble with even-weight BCH subcodes. Finally, 
we observe that the second term in ( TTST l is upper bounded by 
jrzm times the first term. This implies that miscorrection has 
a negligible affect on the threshold as t — > oo. 

V. EXIT Function Bounds 

For coding systems based on iterative decoding, one can 
ask if the gap to capacity is due to the code or to the decoder. 
Fortunately, this question can now be tackled using the theory 
of extrinsic information (EXIT) functions. For example, if we 
ignore the effect of miscorrection and consider the natural 
hard-decision peeling decoder for the (C, m) ensemble based 
on BCH codes, then it is easy to see that at most mt errors can 
be corrected using BDD. To achieve this result, it must happen 
that each code corrects exactly t errors. If some codes decode 
with fewer than t errors, then there is an irreversible loss of 
error-correcting potential. Since there are 2^ code bits per 
code constraint, normalizing this number shows that at most 
2t errors per code constraint are correctable. 

In this section, we apply the theory of EXIT functions for 
the BEC to understand the HDD performance of our spatial- 
coupled LDPC ensemble. The idea is that the DE recursion, 
when miscorrection is ignored, is equivalent to the DE recur- 
sion for the erasure channel with component codes that correct 



t erasures. For this model, we consider the scaled DE recursion 
without miscorrection, A (f+1) = A (o) (A (f) ; t - l). 

One can use EXIT functions to rigorously compute the 
MAP threshold for such a system on the erasure channel. We 
conjecture that the spatially-coupled version of this ensemble 
will have a BEC iterative decoding threshold that equals this 
MAP threshold. Based on this, we can predict the threshold 
of spatially-coupled HDD decoding, in the absence of mis- 
correction, because the peeling decoders for the two problems 
(i.e., BEC and HDD on BSC) have the same threshold. Taking 
this approach, we use the scaled BEC-MAP EXIT function 
h BMAP (X) = (0(A;i- l)) 2 to compute (e.g., see HI) the 
scaled trial entropy 

f X iBMAP/ f_ 

/o 1 >dz(b(\;t-l) 



P(A) 



-dz 



-\<t> (A; t - 1) + 2 / (j)(z\t-l) dz 



Let A BMAP be the unique positive root of P(A). Then, we 
conjecture that the BEC MAP threshold without miscorrection, 

p BMAP = A BMAP^ ( A BMAP. t _ ^ a j SQ ^ threshol( J of 

the spatially-coupled HDD decoder. One can use algebra to 
prove the following lemma. 

Lemma 7. There is a T < oo such that the BEC MAP 
threshold p BMAP >2t-2 for all t > T. 

VI. Numerical Results and Comparison 

We first introduce the definition the e-redundancy optimality 
of a code ensemble. 

Definition 8. Let C(p) be the capacity of a BSC(p). For any 
e > 0, a code ensemble with rate R and threshold p* is called 
e-redundancy achieving if 

1-C(p*) 



1 - R 



> 1 



Let n v = 2 V — 1. The following lemma shows that, a 
sequence of ensembles, parameterized by v G Z+, with rate 
Rv = 1 — — and threshold p* u — — is e-redundancy 
achieving when v is large. 

Lemma 9. Consider a sequence of BSCs with error proba- 
bility 2tn~ 1 for fixed t and increasing v. Then, the ratio of 
1 — C (2trt 1 7 1 ) and 2th l n~ 1 goes to 1 as v — >• oo. That is 

1 - C (2tn- 1 ) 
lim v " ' = 1. (14) 

v^>ca 2tvn v 

Proof: Recall that the capacity of the BSC(p) is 1—H (p), 
where H(p) = —p\og 2 (p) — (1 — p) log 2 (l — p) is the binary 
entropy function. The numerator of ( TT~4-b can be written as 

2t\_2t\og 2 nJ log 2 (f) 



I -C[—) = H[ — \ = 



log 2 n v 



0(n- l )\ .(15) 



By substituting (Q3) into the RHS of (Q~4]), we have 

1 ~ C {i) _ 2tn- 1 \og 2 {n lJ ) 



2tvn v 1 



2tvn v 1 



(i-o^ 1 )). 



Then, the equality ( TBI follows since log 2 (n„) =v+o(l). ■ 
In the following numerical results, the iterative HDD thresh- 
old of (C, m, L, w) spatially-coupled GLDPC ensemble with 
L = 512, and w — 16 are considered. In Table U the 
thresholds of the ensembles are shown by computing the 
average number of error bits attached to a code-constraint 
node. For finite n, let p* t be the iterative HDD threshold of the 
spatially-coupled GLDPC ensemble with (n, k, 2t+ 1) binary 
primitive BCH component code, and p* n ( be the iterative 
HDD threshold of the spatially-coupled GLDPC ensemble 
with (n, k — 1, 2t+ 2) even-weight BCH subcodes. Then, we 
define a* t = np* n t and a* t = np* n t as the thresholds of 
the average number of error bits per component code. When 
n — > oo, we denote A^ and Ajf as the iterative HDD thresholds 
of the ensembles with primitive BCH component code and its 
even-weight subcode, respectively. Moreover, the threshold of 
HDD without miscorrection, AJ" , is shown in TableQ]along with 
the EXIT upper bound on the BEC MAP threshold, aP MAP . 

As shown in Table U the thresholds of the spatially-coupled 
ensemble with the primitive BCH codes or the even-weight 
BCH subcodes are quite close to 2t as t increases. The van- 
ishing effect of miscorrection can be shown using Lemma [4] 
As t increases, both and Q n (?) approach 0. Also in 

Table U one sees that Aj, Ajf and Aj for large t are close 
to each other because the miscorrection probability becomes 
negligible. One can also observe that Ajf , Ajf and Ajf approach 
to 2f as t increases. , we propose the following conjecture. 

Conjecture 10. For any e > 0, there is a t where A^ > 2i(l— e). 

Conjecture [10] implies that one can choose t large enough 
and then N large enough so that p* n t > |£ (1— e) for all n > N. 

Proposition 11. If Conjecture [70l holds, then, for any e > 0, 
there exist a T(e) and an N(e,T) such that, when the com- 
ponent codes are t-error correcting BCH codes with t > T(e) 
and n > N(e,T), the (C,m, L,w) GLDPC spatially-coupled 
ensemble with iterative HDD is e-redundancy achieving. 

Sketch of proof : The proposition follows from Lemma [9] 
and the definition of scaled DE. ■ 



Table I 

The iterative HDD thresholds of (C, oo, 512, 16) 

SPATIALLY-COUPLED GLDPC ENSEMBLE WITH BINARY PRIMITIVE BCH 
CODES AND ITS EVEN- WEIGHT SUBCODES 



a 25S,t 
a 511,t 
"1023, t 

k 



^BMAP 



5.432 7.701 9.818 11.86 13.87 

5.417 7.665 9.811 11.86 13.85 

5.401 7.693 9.821 11.87 13.88 

5.390 7.688 9.822 11.91 13.93 



5.610 7.752 9.843 11.88 13.87 

5.570 7.767 9.811 11.86 13.85 

5.606 7.765 9.841 11.88 13.88 

5.605 7.761 9.840 11.90 13.92 



5.735 7.813 9.855 11.91 13.93 
5.754 7.843 9.896 11.93 13.95 



VII. Conclusion 
The iterative HDD of GLDPC ensembles, based on on t- 
error correcting block codes, is analyzed with and without 
spatial coupling. Using DE analysis, noise thresholds are 
computed for a variety of component codes and decoding 
assumptions. In particular, the case of binary primitive BCH 
component-codes is considered along with their even-weight 
subcodes. For these codes, the miscorrection probability is 
characterized and included in the DE analysis. Scaled DE 
recursions are are also computed for the high-rate limit. When 
miscorrection is neglected, the resulting recursion for the basic 
ensemble matches the results of fl2l . |T3l . Upper bounds on 
the BEC MAP threshold of these codes are also computed and 
the phenomenon of threshold saturation is observed empiri- 
cally. It is conjectured that threshold saturation does indeed 
occur for the scaled DE and, assuming this is true, it is 
shown that that iterative hard-decision decoding (HDD) of the 
spatially-coupled GLDPC ensemble can approach capacity in 
high-rate regime. Finally, numerical results are presented that 
both support this conjecture and demonstrate the effectiveness 
of these codes for high-speed communication systems. 
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